Incremental speech synthesis
نویسنده
چکیده
Human interaction with spoken dialogue systems differ in many ways from their interactions with each other. One notable example is that spoken dialogue systems tend to have a strict concept of turns which makes the dialogue more similar to a ping-pong game than to humans conversing. Given that we aim at creating spoken dialogue systems that can engage in human-like conversation (note that although this is the case for most dialogue work at KTH, it is not true by necessity; for a discussion see Edlund et al., 2008), this rigid dependence on turns needs a solution. The present paper discusses a step in that direction: the use of incremental varieties of speech synthesis. A brief background and discussion on incrementality in spoken dialogue systems is given, followed by a discussion of the specific requirements an incremental speech synthesis should meet, and a presentation of a prototype system meeting some of these requirements.
منابع مشابه
The INPROTK 2012 Release: A Toolkit for Incremental Spoken Dialogue Processing
We describe the 2012 release of INPROTK, our “Incremental Processing Toolkit” which combines a powerful and extensible architecture for incremental processing with components for incremental speech recognition and, new to this release, incremental speech synthesis. These components work domainindependently; we also provide example implementations of higher-level components such as natural langu...
متن کاملINPRO_iSS: A Component for Just-In-Time Incremental Speech Synthesis
We present a component for incremental speech synthesis (iSS) and a set of applications that demonstrate its capabilities. This component can be used to increase the responsivity and naturalness of spoken interactive systems. While iSS can show its full strength in systems that generate output incrementally, we also discuss how even otherwise unchanged systems may profit from its capabilities.
متن کاملEvaluating Prosodic Processing for Incremental Speech Synthesis
Incremental speech synthesis (iSS) accepts input and produces output in consecutive chunks that only together result in a full utterance. Systems that use iSS thus have the ability to adapt their utterances while they are ongoing. However, starting to process with less than the full utterance available prohibits global optimization, leading to potentially suboptimal solutions. In this paper, we...
متن کاملIncremental Dialogue Processing in a Micro-Domain
This paper describes a fully incremental dialogue system that can engage in dialogues in a simple domain, number dictation. Because it uses incremental speech recognition and prosodic analysis, the system can give rapid feedback as the user is speaking, with a very short latency of around 200ms. Because it uses incremental speech synthesis and self-monitoring, the system can react to feedback f...
متن کاملHmm-based Incremental Speech Synthesis
Incremental speech synthesis aims at delivering the synthetic voice while the sentence is still being typed. The main challenges are the online estimation of the target prosody from a partial knowledge of the sentence’s syntactic structure, the online phonetization and estimation of parts of speech and the timing of the delivery. This thesis aims at solving these challenges resulting in the imp...
متن کاملCombining Incremental Language Generation and Incremental Speech Synthesis for Adaptive Information Presentation
Participants in a conversation are normally receptive to their surroundings and their interlocutors, even while they are speaking and can, if necessary, adapt their ongoing utterance. Typical dialogue systems are not receptive and cannot adapt while uttering. We present combinable components for incremental natural language generation and incremental speech synthesis and demonstrate the flexibi...
متن کامل